arXiv: 1502.00409vl [math.CO] 2 Feb 2015 


Decomposing a Graph Into Expanding Subgraphs 


Guy Moshkovitz* Asaf Shapira^ 


Abstract 

A paradigm that was successfully applied in the study of both pure and algorithmic problems 
in graph theory can be colloquially summarized as stating that any graph is close to being the 
disjoint union of expanders. Our goal in this paper is to show that in several of the instantiations 
of the above approach, the quantitative bounds that were obtained are essentially best possible. 
Three examples of our results are the following: 

• A classical result of Lipton, Rose and Tarjan from 1979 states that if is a hereditary 

family of graphs and every graph in has a vertex separator of size n/(log then 

every graph in T has 0{n) edges. We construct a hereditary family of graphs with vertex 
separators of size n/(logn)^“°(^) such that not all graphs in the family have Oln) edges. 

• Motivated by the Unique Games Conjecture, Trevisan and Arora-Barak-Steurer showed 
that given a graph G, one can remove only 1% of G’s edges and thus obtain a graph in 
which each connected component has good expansion properties. We show that in both 
of these decomposition results, the expansion properties they guarantee are (essentially) 
best possible even when one is allowed to remove 99% of G’s edges. In particular, our 
results imply that the eigenspace enumeration approach of Arora-Barak-Steurer cannot 
give (even quasi-) polynomial time algorithms for unique games. 

• Sudakov and the second author have recently shown that every graph with average degree 
d contains an n-vertex subgraph with average degree at least (1 — o(l))d and vertex ex¬ 
pansion 1/ log^~'’°^^^ n. We show that one cannot guarantee a better vertex-expansion even 
if allowing the average degree to be 0(1). 

The above results are obtained as corollaries of a new family of graphs, which we construct 
by picking random subgraphs of the hypercube, and analyze using (simple) arguments from the 
theory of metric embedding. 


1 Introduction 

In recent years a certain paradigm has emerged which roughly says that any graph is close to 
being a vertex-disjoint union of expanders. Unlike the e-regular partitions of Szemeredi [30] and 
the cut decompositions of Frieze-Kannan [9] which are relevant only for dense graphs (i.e., with 
0(n^) edges), this paradigm is applicable for graphs of arbitrary density. Generally speaking, its 
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usefulness stems from the fact that it allows for a divide-and-conquer approach, in essence reducing 
a problem on general graphs to the special case of expander graphs. 

For algorithmic problems, this paradigm has been usefnl, for example, in designing approxima¬ 
tion algorithms related to the Unique Games Conjecture [14], as in the seminal work of Arora-Barak- 
Steurer [3] and of Trevisan [31]. It has also seen applications in property-testing algorithms [7, 10], 
as well as in data structure design [25]. In graph theory, instantiations of this paradigm include 
the theorem of Lipton-Tarjan-Rose [22] on vertex separators in hereditary families, the results of 
Linial-Saks [21] and Leighton-Rao [19] on low-diameter decompositions, as well as resnlts in the 
field of graph minors [17, 27] and cycle packing [6]. See Section 2 for some further discussion. 

We note that in all applications of the regularity lemmas [9, 30] one uses the same notion of 
expansion^. On the other hand, when dealing with sparse graphs and applying the above-mentioned 
paradigm, each application calls for a different notion of expansion. Onr main goal in this paper 
is to show that in several of the above-mentioned applications, the different notions of expansion 
that were used are quantitatively best possible. 

As it tnrns out, all our results can be deduced from a construction of a new family of graphs 
(stated in Theorem 1), whose main property is that every subgraph of it has a small edge separator, 
which in particular implies small vertex/edge expansion. We next recall some basic definitions. 

Definition 1.1 (Edge/Vertex Separator). An edge separator in a graph G = {V,E) is a set of 
edges whose removal disconnects G into two (not necessarily connected) subgraphs, each on at most 
I jUj vertices. We write sepE(G) for the minimum cardinality of an edge separator in G. 

A vertex separator is defined in the same way, only now vertices are removed rather than edges. 
We write sepv(G) for the minimum cardinality of a vertex separator in G 

Note that sepv(G) < sepE(G), simply by taking an arbitrary endpoint of each separator edge. 
The girth of a graph is the minimum length of a cycle in it. We henceforth write log(-) for log 2 (-). 
Our main technical resnlt in this paper, stated next, implies that there exist graphs with a super- 
linear number of edges, whose girth is (log and whose t-vertex subgraphs all have expansion 

at most (log 

Theorem 1. For any n, k with 2 < k < ^ log log n there is an n-vertex graph G = Gn,k satisfying: 

i. G has average degree at least k and maximum degree at most 6k. 

ii. G has girth at least log n/(6A:)^. 

in. Every t-vertex subgraph H of G with t > logn/(6/c)^ satisfies 

sepv(R') < sepE(R') < • (log log . (1) 

logf 

Let us briefly explain why the quantitative estimates in Theorem 1 are (essentially) best possible. 
First, observe that for snbgraphs H of Gn,k in Theorem 1 on fewer than logn/(6A:)^ vertices, we of 
course have a much stronger bonnd on sepv(F7) and sepE^H) than the one stated in (1) for larger 
subgraphs. To see this, one just has to recall the following well-known fact. 

^In the setting of the regularity lemma, expansion is referred to as being e-regular. 
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Fact 1.2. Every tree T satisfies sepv(F) < 1. 

So item (ii) in Theorem 1 implies that subgraphs H of Gn,k on fewer than logn/( 6 /c)^ vertices 
satisfy sepy{H) < 1 and thus also sepE(-ff) < ^{H) < A(G) (< log log n).^ Note that the bound 
on sepE(-f^) cannot be improved by much beyond A(G) since G contains A'i,a(g) (tbe star with 
A(G) leaves) as a subgraph, and it is not hard to see that sepE(A'i^A) A A/3. Second, as we 
discuss in Section 6 , any family of graphs satisfying item {in) (or variants of it) contains at most 
n(loglogn)‘^(^) edges, so one cannot construct graphs as above with average degree much larger 
than log log re. Finally, it follows from Theorem 6 below that any graph of average degree at least 
3 has a subgraph that is not a tree and does not contain a separator of size much smaller than the 
one in ( 1 ), so our estimate on the size of the separator in item (Hi) cannot be reduced by more 
than some log log t terms. Actually, the proof of Theorem 1 shows that in ( 1 ) we can replace the 
(loglogt)^ term by (loglogt)^^‘’^^\ but we chose to use the simpler/cleaner expression. 


1.1 Small set expansion 

Next we describe a strengthening of Theorem 1 which will be important for some of our applications. 
Notice that Theorem 1 gives a graph whose every subgraph has a large subset (i.e., consisting of at 
least 1/3 of its vertices) that does not expand. One may thus further ask for a graph that does not 
contain even small set expanders] that is, a graph whose every subgraph has a small non-expanding 
set (i.e., consisting of o(l)-fraction of the vertices). The notion of small set expanders has recently 
received much attention in theoretical computer science (see, e.g., [3, 28, 29]). For example, it 
is believed that solving unique games is intimately connected with the problem of finding small 
non-expanding sets. 

For a graph H and a subset 0 7 ^ A C V{H) we write dni-A) for the set of edges of H with 
precisely one endpoint in A, and finiA) for the edge expansion of A in H, that is, 




\dH{A)\ 

1^1 


We have the following “small set” counterpart of Theorem 1 . Note that here we in fact prove the 
stronger property that one can essentially partition the subgraph into small non-expanding subsets. 


Theorem 2. Let G = G^^k be a graph from Theorem 1 with 2 < k < ^ log log log re. For every 
t-vertex subgraph H of G with t > logre/(6A:)^, and every 1/t < /r < 2/3, there are [l/d/z] mutually 
disjoint subsets Ai C V{H) of size fit/3 < jA/ < fit satisfying 


</H(A,)<^^-(141oglogt)2. (2) 

logt 

Note that in Theorem 2 we can find, in any large subgraph H, subsets that simultaneously 
have “measure” o(l) and edge expansion o(l). For example, by setting fi = 1/logt in (2), 
each of the subsets A C V{H) produced by the theorem satisfies |A| /t = 1/logt and finiA) < 
(l/logt)(141oglogt)^ = l/(logt)^“Ai). 

^A(-) denotes the maximum degree of a graph. 
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Similarly to Theorem 1, the quantitative estimates in Theorem 2 are (essentially) best possible. 
It follows from Theorem 6 below that any graph of average degree at least 3 has a t-vertex subgraph 
whose every subset of measure /r has edge expansion at least (log log Thus, our estimate 

of the edge expansion cannot be reduced by more than some log log t terms. Finally, note that while 
Theorem 2 only deals with subgraphs H on at least logn/(6A;)^ vertices, any smaller subgraph is 
a tree (due to item (ii) in Theorem 1), implying that it trivially has small subsets with weak 
expansion. 

1.2 Paper overview 

In Section 2 we describe the main results of this paper, which are the applications of Theorems 1 
and 2, showing that many decomposition-type results that were used in different areas of research 
are essentially best possible. These include applications in approximation algorithms, property 
testing, extremal problems in graph theory as well as the nested dissection method. The proof of 
Theorem 1 is given in Section 3 (with some technical proofs differed to Section 5) and the proof of 
Theorem 2 is given in Section 4. Section 6 contains some concluding remarks and open problems. 
Let us thus give an overview of the proofs of the our theorems. 

The main idea of the proof of Theorem 1 is to start with the Boolean hypercube Qd and choose 
an appropriate random subgraph G of it. Using (simple) probabilistic and combinatorial arguments, 
we show that this random subgraph G satisfies items (i) and (ii) of Theorem 1. To prove the third 
item, we first prove that all bounded degree subgraphs of Qd have small edge expansion. To this 
end, we use an argument similar to the one used by Linial, London and Rabinovich [20] in the 
setting of metric embedding. Then, we proceed to “boost” this fact in order to show that every 
sufficiently large subgraph of G actually has a small edge separator, thus establishing item (Hi) of 
Theorem 1. 

In order to prove Theorem 2, we first show that the hereditary nature of item (in) of Theorem 1 
can be used to iteratively find smaller and smaller subsets while controlling their expansion. This 
enables us to find a single small subset having small expansion. Next, we remove this subset and 
look for another small subset, this time having small expansion in the remaining subgraph. We 
continue in this manner until we get almost a partition of the vertex set. Finally, we show that 
many of the subsets we obtained have small expansion also in the original subgraph. 

2 Applications of Main Results 

2.1 Edge density of hereditary families with small separators 

A family of graphs is said to be hereditary if it is closed under taking induced subgraphs. The 
well-known Planar Separator theorem of Lipton-Tarjan [23] asserts that any n-vertex planar graph 
has a vertex separator of cardinality at most 0{y/n). This influential result lead to many extensions 
for other hereditary families of graph (such as minor-free families, see [2]). The notion of vertex 
separator in graphs has since found numerous applications in studying both pure and algorithmic 
problems in graph theory (see [26] and its references). One of the first applications of separators was 
given by Lipton, Rose, and Tarjan [22] in their work on the nested dissection method. Intuitively, 
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their result states that the reason planar graphs (and more generally minor-free graphs) have 
linearly many edges is that they have small separators. 

Theorem 3 ([22], Theorem 10). For every e > 0 there is C > 0 for which the following holds. 
Let F he a hereditary family of graphs such that every n-vertex graph G ^ F satisfies sepv(G') < 
n/(log n)^"''^. Then every n-vertex graph in F has at most Cn edges. 

We mention that Fox and Pach [8] strengthened Theorem 3 by proving that even separators of 
size n/logn(loglogn)^+’^ guarantee^ that every graph in F has 0{n) edges. 

Using Theorem 1 we next show that the separation requirement in Theorem 3 cannot be im¬ 
proved much beyond re/log n. 

Corollary 2.1. There is a hereditary family of graphs F such that every n-vertex graph G ^ F 
satisfies sepv(G) < (re/log re)(log log re)^, yet there is no G > 0 such that every n-vertex graph in 
F has at most Gn edges. 

Proof. Let F be the family of graphs defined as follows: 

F = {G : G is an induced subgraph of „/g 48 for some re} , 

where jog log „/g 48 are the graphs from Theorem 1. Note that F is, by definition, a hereditary 
family of graphs. By items {ii) and (Hi) of Theorem 1 (and Fact 1.2), every re-vertex graph G ^ F 
satisfies sepv(G) < (re/logre)(loglogre)^, as required. On the other hand, by item (i) of Theorem 1, 
the family F contains, for every large enough re, an re-vertex graph with n(reloglogre) edges. This 
completes the proof. □ 

2.2 Decomposing a graph into edge expanders 

Trevisan [31], in his work on approximating constraints satisfaction problems related to the Unique 
Games Conjecture, gave a combinatorial algorithm for approximating unique games. The main 
component of the algorithm is a decomposition of any graph into disjoint expanders. Given the 
graph underlying the unique game input, the algorithm proceeds to solve the problem on each 
expander separately, which, as Trevisan shows, can be done in polynomial time. We note that we 
will revisit this paradigm for solving unique games in Section 2.3, where we discuss the well-known 
Arora-Barak-Steurer [3] algorithm. We begin with the formal dehnitions. 

A (linear) unique game is a system of linear equations of the form Xi — xj = Cij (mod q). Let 
UGq((5) denote the computational problem of deciding, given a unique game with equations modulo 
q, whether there is an assignment satisfying at least a (1 — 5)-fraction of the equations or rather 
every assignment satisfies at most a J-fraction of the equations (we are guaranteed that one of the 
two cases holds). The influential Unique Games Conjecture^ of Khot [14] postulates that UGg((j) 
is NP-hard for every 0 < <5 < 1/2 and sufficiently large q. Trevisan’s algorithm efficiently solves a 
weaker version of UGq((5) (and therefore does not refute the Unique Games Conjecture). 

A graph G = {V,E) is said to have edge expansion a if every subset S C. V with |5| < |U| /2 
satisfies |d(5')| > o;|5| (recall that the edge boundary d{S) of S is the set of edges with precisely 

®The result of [8] actually shows that having separators of size n/log n log log n(log log log n)^ suffices. 

^The version stated here was shown in [15] to be equivalent to Khot’s original formulation in [14]. 
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one endpoint in S). Trevisan proved the following decompositions theorem for arbitrary graphs. 
Roughly speaking, it asserts^ that, given any graph, one can remove few of the edges in order to 
disconnect it into disjoint graphs of high expansion®. 

Theorem 4 ([31], Lemma 10). Let 0 < e < 1/2. From any n-vertex graph one can remove at most 
an e-fraction of the edges to obtain a graph whose every connected component has edge expansion 
at least ejVl log n. 

Using Theorem 1 we show in Corollary 2.3 below that, already for e = 1/2, the expansion 
guarantee in Theorem 4 cannot be improved much beyond 1/logn. In fact, we show that the 
same conclusion holds even when e = 1 — o(l), that is, even if one is allowed to remove all but a 
o(l)-fraction of the edges of the graph! To this end, we will first need the following observation 
regarding the graphs Gn,k from Theorem 1. 

Claim 2.2. Any subgraph of Gn^k with average degree at least 4 has at least vertices. 

Proof. Let LI be a t-vertex subgraph of G with average degree at least 4. Recall the well-known fact 
that the girth of a t-vertex graph of average degree at least 4 is at most 2 log t (in fact, average degree 
at least 3 suffices, see [1]). Item (ii) in Theorem 1 thus implies logn/(6fc)^ < 21ogt, completing 
the proof. □ 

Corollary 2.3. Let G = G^^k be a graph from Theorem 1. Any subgraph of G with average degree 
at least 4 has a connected component with edge expansion at most (l/logn)(15A:loglogn)^. 

Proof. Consider a subgraph of G with average degree at least 4, and note that it has a connected 
component H with average degree at least 4. Writing t for the number of vertices of PI, Claim 2.2 
implies that logt > logn/72A:^. It follows from item (in) in Theorem 1 that the edge expansion of 
H is at most (t/logt)(loglogt)^ • (3/t) < (l/logn)(15/cloglogn)^. □ 

By Corollary 2.3, even after removing a fraction of 1 — 4/A: of the edges of the remaining 
graph has a connected component with edge expansion at most 0((1/log n)(A; log log n)^). So 
for example, taking k = 8 gives a bounded-degree graph such that even after removing up to 
half of its edges, the remaining graph has a connected component with edge expansion at most 
0((l/logn)(loglogn)^). At the other extreme, if A: = loglogn/648 then even after removing 
a (1 — o(l))-fraction of the edges, the remaining graph has a connected component with edge 
expansion at most (l/logn)(loglogn)^. 

2.3 Threshold-rank decomposition 

Recall that in Section 2.2 we discussed a paradigm for solving unique games by decomposing any 
graph into expanding subgraphs. The same paradigm was used in the breakthrough paper of Arora, 
Barak and Steurer [3] which gave the first algorithm to come close to refuting the Unique Games 
Conjecture, solving UGq((5) in subexponential time exp(g • (here n is the number of vari¬ 

ables). The algorithm in [3] employed the eigenspace enumeration method where, unlike Trevisan’s 

®We note that prior to [31], Goldreich and Ron [10] have implicitly proved a result of the same spirit, whose exact 
quantitative properties are somewhat more complicated to state. See also [13] 

®Trevisan originally used the notion of conductance; our lower bound applies even for edge expansion. 
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decomposition, the pseudorandom property which the connected components must satisfy depends 
on the entire spectrum of their adjacency matrix. As in Section 2.2, we prove that one cannot 
substantially improve the obtained decomposition. Formal definitions follow. 

For a d-regular graph G, denote by rankT-(G) the number (with multiplicities) of eigenvalues A 
of the adjacency matrix of G satisfying |A| > rd. Let Rn{r],€) denote the minimum integer such 
that for any n-vertex graph G, one can remove at most an e-fraction of its edges so that each 
connected component H of the new graph satisfies ranki_j,(Ff*) < Rn{T],e), where H* is obtained 
from H by adding self-loops to make it A(G)-regular. The main result of [3] can be summarized 
as follows. 

Theorem 5 ([28], Theorem 5.6). There are absolute constants c,c' > 0 such that: 

i. For every 0 < r], e < 1, 

Moreover, a decomposition achieving this bound can be found in polynomial timeJ 

ii. \JGq{6) can be solved in time ex.p{q ■ Rn{0{5),c')). 

The first step in the algorithm of [3] (item (i) in Theorem 5 which, as the authors themselves 
write, is the main component of the algorithm) is the efficient decomposition of any graph, by 
removing few of its edges, into vertex-disjoint subgraphs of small threshold rank. This decomposi¬ 
tion is applied on the graph underlying the input unique game, and then the problem is solved on 
each connected component. The second step (item (ii) in Theorem 5) follows by applying, for each 
connected component, a brute force enumeration over the eigensapces corresponding to its large 
eigenvalues. The second step is inspired by works of Kolia and Tulsiani [4] and Kolia [16], who 
previously used eigenspace enumeration to solve unique games on Cayley graphs. Combining the 
two items of Theorem 5 shows that the algorithm of [3] runs in time exp(g • 

Note that in order to obtain a faster algorithm using the eigenspace enumeration method, one 
must improve the bound on Rn{r],e) in Theorem 5. Up to the present work it was conceivable® 
that for example Rniv, e) < (log say for e > eo- Such a decomposition would have implied 

a quasipolynomial time algorithm for \JGq{6), practically refuting® the Unique Games Conjecture. 

As we prove below, the bound on Rn{r],e) in Theorem 5 is essentially tight. Namely, we show 
that one can derive from Theorem 2 a nearly polynomial lower bound on Rn{r], e). This essentially 
eliminates the possibility of disproving the Unique Games Conjecture using the eigenspace enu¬ 
meration approach of [3]. See Section 6 for more details on this. We note that our lower bound 
applies even when e = 1 — o(l), that is, even when one is allowed to remove all but a o(l)-fraction of 
the graph’s edges. Naturally, the bound holds regardless of the running time of the decomposition 
algorithm. 

^So for example, for every d-regular n-vertex graph G, one can efficiently remove at most (say) 1% of the edges 
of G so that for each connected component in the resulting graph, its “regularized” adjacency matrix has at most 
jjO(r 7 ! ) eigenvalues larger than (1 — y)d. 

®The main tool used in [3] in order to prove Theorem 5 is a higher-order Cheeger-type inequality. It was recently 
shown in [5] that this inequality is essentially best possible, but to the best of our knowledge, this does not rule out 
the possibility of improving Theorem 5 by other means. 

®Say, under the so-called Exponential Time Hypothesis. 
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Corollary 2.4. For every n > uq, 0 < t] < 1, and e < 1 — 3000/log log log n, we have 

Rn{r],e) > _ 

For the proof of Corollary 2.4 we will need the “easy” direction of the higher-order Cheeger 
inequality that hrst appeared in [18]. For completeness, we give a short and simple proof in 
Section 5. 

Lemma 2.5. Let G = iViE) he a d-regular (multi-graph, and let Xi > X 2 > ■ ■ ■ > A|y| be the 
eigenvalues of the adjacency matrix of G. Then for every k, 

^ J'’" < min max (poiSi) , 

2 Si,...,Skl<i<k 

where the minimum is over all collections of k mutually disjoint, non-empty subsets ofV. 

Proof of Corollary 2.j. Let G = Gn,k be a graph from Theorem 1. We will show that any subgraph 
of G with average degree at least 4 has a connected component H satisfying 

ranki_^(iL*) > in^/fc( 200 iogiogn )2 ^ ^ 3 ^ 

Setting k = logloglogn/648, this would mean that removing an e-fraction of the edges of G, even 
with e as large as 1 — 4/A; > 1 — 3000/log log log n, leaves a connected component H that, for 
every n > uq, satishes ranki_jj(FA*) > ”) . This would imply the desired lower bound on 

Consider a subgraph of G with average degree at least 4 and note that it has a connected 
component H with average degree at least 4. Writing t for the number of vertices of H, Claim 2.2 
implies that t > (> logn/( 6 A:)^). Apply Theorem 2 on H with n = i/tvk/{ 20 iog\ogt)\ 

that, as required by Theorem 2, we may assume /x > 1/t (as one may verify, recalling 77 < 1) and 
p < 2/3 (as otherwise TiV^^isooiogiogn)^ ^ ^ 3 ^ 2 ^ go ( 3 ) trivially holds). We thus obtain 

5 .— J_ — l^»?fc/( 201 oglogi )2 ^ l^??/fc( 2001 oglogn )2 
4/x 4 “4 

mutually disjoint and non-empty subsets of H that each have edge expansion in H at most 
(log(l//x)/logi)(141oglogt)^ < 'qk/2. Let Ai > A 2 > • • • > A* be the eigenvalues of the adja¬ 
cency matrix of H* (recall that H* is obtained by adding to H sufficiently many self-loops so as 
to make it A(G)-regular). Note that adding self-loops does not alter the expansion of any subset. 
Therefore, Lemma 2.5 implies that A(G) — Xb < rjk. It follows that for every 1 < i < 6 we have 
Xi > Xb > A(G) — rjk > {1 — rj)A{G). This shows that ranki_j,(FA*) > 6 , as required in (3), and 
therefore completes the proof. □ 

2.4 Finding a single vertex-expanding snbgraph 

For a graph G, a subset S C V{G) is said to have vertex expansion a if |iV(S)| = a IS], where N{S) 
denotes the set of vertices outside of S that have a neighbor in S. Motivated by certain extremal 
problems related to graph minors, it was shown in [27] that every graph G contains a subgraph H, 
such that H has good vertex expansion properties and almost the same average degree as G. 

this we mean that we allow self-loops. 




Theorem 6 ([27], Lemma 1.2). Let 0 < e < 2“®. Every graph G of average degree k contains a 
t-vertex subgraph H with average degree at least (1 — e)k such that every subset ofV{H) of size jxt 
with 1/t < fi < 1/2 has vertex expansion in H at least e ■ log(l//i)/logt(21oglogt)^. 

Theorem 2, together with item (ii) of Theorem 1 and Fact 1.2, yields the following corollary, 
which shows that the expansion guarantee in Theorem 6 above is essentially best possible. 

Corollary 2.6. Let G be a graph from Theorem 2, and let H be a t-vertex subgraph of G. Then 
for every 1/t < < 2/3, there is a subset ofV{H) of size in [iJ,t/3,p,t] whose vertex expansion in 

H is at most (log(l//i)/logt) • (141oglogt)^. 

2.5 Hyperfinite families of graphs 

A graph is said to be (e, q)-hyperfinite if one can remove an e-fraction of its edges and thus decompose 
it into connected components of size at most q each. A family of graphs is said to be hyperfinite 
if there is a function q such that for every e > 0, every graph in the family is (e, q(e))-hyperfinite. 
Hyperfinite families of graphs have been extensively studied in recent years, mainly because of their 
role in the theory of graph limits of sparse graphs (see [24] ). Motivated by certain questions related 
to the design of property-testing algorithms, it was shown in [7] that a hereditary family of graphs 
in which every graph has a small edge separator must be hyperfinite. More precisely, the following 
holds. 

Theorem 7 ([7], Corollary 3.2). Let T be a hereditary family of graphs such that every n-vertex 
graph G ^ F satisfies sepE(G) < n/logn(loglogn)^. Then, F is hyperfinite. 

Using Theorem 1 we show that the edge-separation requirement in Theorem 7 cannot be im¬ 
proved much beyond re/log re. 

Corollary 2.7. The graphs Gn,8 from Theorem 1 are not -hyperfinite. In particular, 

setting 

Fs = {G : G is an induced subgraph of Gn^s for some re} 

we get a hereditary family of graphs that is not hyperfinite, despite the fact that every n-vertex 
graph G G Fg satisfies sepE(G) < (re/logre)(loglogre)^ -|- 48. 

Proof. To see that the first assertion holds, note that after removing at most half of the edges of 
Gn,8) we obtain a graph with average degree at least 4. This graph has a connected component 
H of average degree at least 4. By Claim 2.2, the number of vertices in H is least As 

to the second assertion of the corollary, note that the first assertion clearly means that F^ is not 
hyperfinite. Also, note that the fact that sepE(G) < (re/log re)(log log re)^ -|-48 holds for every 
G G Fg follows from items {i),{ii) and {Hi) of Theorem 1 together with Fact 1.2. □ 

3 Proof of Theorem 1 

As mentioned earlier, in order to construct the graphs Gn,k of Theorem 1 we pick a suitable random 
subgraph of the hypercube. The d-cube, denoted Qd, is the graph with vertex set {0,1}'^ where two 
vertices are adjacent if their corresponding vectors differ in exactly one coordinate. Note that Qd is 
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a d-regular graph on 2^^ vertices. The proof will have three main steps, appearing in the following 
three subsections. In the first subsection we define the graphs Gn,k and prove that they satisfy the 
first two properties of Theorem 1. In the second step we prove that large subgraphs of the cube do 
not have good expansion properties. Finally, in the last subsection we combine the results from the 
first two subsections to prove that the graphs Gn,k also satisfy the third property in Theorem 1, 
thus completing its proof. 

3.1 Construction 

Our goal in this subsection is to prove the following lemma. 

Lemma 3.1. For any d G N and 2 < k < d there is a subgraph Q = Qd,k of the d-cube, on all 2'^ 
vertices, such that: 

i. The average degree of Q is at least k. 

ii. The maximum degree of Q is at most 3k. 

Hi. The girth of Q is at least d/{3k)‘^. 

For the proof of the above lemma we will need the following upper bound on the number of 
cycles of a given length in the cube. 

Claim 3.2. The number of cycles of length 2£ in the d-cube is at most 2'^{dlY. 

Proof. We will show that the number of closed walks of length 2£, starting and ending at a given ver¬ 
tex, is at most {diY; this would immediately imply the stated bound. We claim that each such closed 
walk corresponds to a sequence (xi,... ,X 2 e) G with the property that |{z G [2^] : Xi = t}| is 
even for every t G [d]. To see this, recall that each vertex in the graph corresponds to an element of 
{0, l}'^. The claim follows by considering, for each edge along a closed walk beginning (and ending) 
with a given vertex, the unique index in which the bit “flips”. So all that is left is to bound from 
above the number of sequences as above. Now, observe that all these sequences can be generated 
by first partitioning the set of indices {!,..., 2i} into I pairs and then assigning to each pair a 
value from {!,... , d} (of course, this process will generate some sequences several times). Since the 
number of ways one can pair the elements of {!,... , 2F\ is given by 

(fit - 1)!! = (fit - 1)(2^ - 3) • • • 3 • 1 < 2^^!/2 < 2^(^/2)^ = Y , 

we get that the number of sequences is bounded from above by d^ • □ 

To prove the existence of a graph as in Lemma 3.1 we use the so-called probabilistic “deletion 
method”. 

Proof of Lemma 3.1 . Denote by G' the random subgraph of the d-cube where each edge is inde¬ 
pendently retained with probability p = 3k/d. Let the random variable X count the number of 
edges of G'. Then 

E[X] = p ■ d2'^-^ = 3k ■ 2'^“^ . (4) 
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Set L = d/9k‘^ and let the random variable Y count the number of cycles of length at most L in 
G'. By Claim 3.2, 


L/2 L/2 L/2 L/2 

lE[h"] < = 2^Y^{p‘^d(.f = 2^'Y^{llVf < 2'^ ^(1/2)^ < 2'^“^ . (5) 

t=2 i=2 1=2 1=2 

Let the random variable Z count the total number of “excess” edges in G', that is, 

Z = ^ (deg(u) - 3/c) . 

V : deg{v)>3k 

For each vertex v, the random variable deg(u) follows the binomial distribution B{d,3k/d), so 
E[deg(?;)] = 3k. Note that ldeg(?;) — 3/i;| = X]^(3A: — deg(u)) + 2Z, which means E[Z] = 
^ E |deg('(;) — 3A:|. By Jensen’s inequality, 

(E |deg(u) — 3A;|)^ < E[(deg(u) — 3k)‘^] = Var[deg(u)] < 3k , 

implying that 

E[Z] = i^E|deg(u)-3/c| < v^-2'^-i . (6) 

V 

Combining (4),(5), and (6) we get 

E[X -Y -Z]>{3k-1- Vsfc) • 2'^-^ > k ■ 2'^"^ , (7) 

where the last inequality can be easily checked to hold for any k >2. Let Q be obtained from G' 
by removing an arbitrary edge from each cycle of length at most d/9k‘^, as well as removing, for 
each vertex v with deg(u) > 3k, arbitrary deg(u) — 3k adjacent edges. Clearly, Q satisfies the last 
two requirements in the statement. Moreover, we have from (7) that the expected average degree 
of Q is at least k. The existence of a subgraph of the d-cube as required immediately follows. □ 

3.2 Expansion in the hypercube 

In this subsection we prove that large subgraphs of the d-cube are not good edge expanders. Recall 
that a graph G = {V,E) is said to have edge expansion a if every subset S CV with |5| < \V\ /2 
satisfies |9(5')| > a IjSI (the edge boundary d{S) is the set of edges with exactly one vertex in S). 
Our goal in this subsection is to prove the following lemma. 

Lemma 3.3. Any (not necessarily induced) subgraph of the d-cube with t vertices and average 
degree r has edge expansion at most 2rlogd/log(t/2). 

We first need the following easy claim, in which the distance between two vertices u, u in a 
graph G, denoted 6g{u,v), is the length of a shortest path connecting u and v. 

Claim 3.4. For every graph G of maximum degree A > 1, and every t-vertex subset S C V{G), 
the average distance Y{u v}g{^) ( 2 ) /east log(t/2)/(2 log A). 
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Proof. Let v £ S. We claim that there are at least t/2 vertices in S of distance at least £ = 
log(t/2)/log A from v. Indeed, the number of vertices of distance at most i — 1 from v is at most 
A* < < t/2. It follows that the average distance is at least £/2. □ 

We are now ready to prove the main result of this subsection. 


Proof of Lemma 3.3. Let H = {S,E) be a t-vertex subgraph of the d-cube Qd of average degree 
r, and let a denote the edge expansion of H. We need to show that a < 2r logd/log(t/2). Note 
that we may assume a > 0 (i.e., that H is connected), as otherwise there is nothing to prove. 
For each vertex v £ S, write {vi,... ,Vd) £ {0,1}'^ for the corresponding binary vector. Notice 
6q^{u,v) = \ui-Vi\. Observe that 


Tl{u,v}£E^Qdi'^^'^) J2i=lYl{u,v}eE\'^i ~ '^i\ ^ 

-i- i - = --i- > min - 


where the inequality holds^^ for all non-negative reals xi,yi,... ,Xd,yd- Let i £ [d] achieve the 
minimum in ( 8 ) (which is well defined since yi 0 for some i), and set T = {v £ S : Uj = 1}. 
Note that since y* > 0 we have 0 < |T| < IS"! and that we can in fact assume that 0 < |T| < [S'! /2 
(otherwise we replace T with S\T). Noting that in (8) we have Xi = \dT\ and y* = |T| [S' \ r|, we 
deduce that 

Y^{u,v}eE^Qdi'^^'^) ^ \dT\ . 195'I ^ a 

-— - - > - > min -> -. 


Therefore, 


^ ^ 2 _^ 

2 ) {«,OG(f) {n,v}eE 


m 

a|5| 


r 

a 


(9) 


Applying Claim 3.4 with G = Qd and the set 5, the left hand side of (9) is at least log(t/2)/21ogd. 
The desired bound on a follows. □ 


3.3 Putting it all together 

To prove Theorem 1 we will show that the graphs constructed in Lemma 3.1 have the property that 
every subgraph H has a small edge separator. To this end, we start by proving that every subgraph 
H has a small edge expansion. In the proof we will consider two cases, depending on the number of 
vertices of H. If H has at most vertices, then having high expansion implies (see Lemma 3.5 
below) the existence of a cycle of length 0{d/k‘^); however, such a short cycle does not exist in the 
graph. Otherwise, since the maximum degree of the graph is bounded. Lemma 3.3 implies that H 
has small edge expansion. Finally, we boost this “hereditary” non-expansion property to construct 
a small separator in H (see Lemma 3.6 below). 

To execute the above proof strategy we will need the following two lemmas, whose proofs appear 
in Section 5. Recall that we use sepE(G') to denote the size of the smallest edge separator in G. 

^^Indeed, denoting the minimum by m, we have Xi > myt for every i (even if yi = 0), so Xi) > Vi)- 
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Lemma 3.5. Every connected n-vertex graph with edge expansion a (> 0) that is not a tree has 
girth at most 2(2/a + l)(ln(n) + 1) + 1. In particular, if a <1 then the girth is at most 121nre/a. 

Lemma 3.6. Let G he an n-vertex graph whose every t-vertex subgraph has expansion at most f{t), 
for every t > n/3 and where f : [n/3, n] W is decreasing. Then sepE(G) < (2/3)n/(n/3). 

We can finally give a proof for the main theorem. 

Proof of Theorem 1. First, suppose that n is a power of 2, and write n = 2^. Let G = Qd, 2 k be a 
subgraph of the h-cube as guaranteed by Lemma 3.1. For the rest of the proof set to = d/{6k)‘^. 
The assertion of Lemma 3.1 implies that G has average degree at least 2k, maximum degree at 
most 6k, and girth at least to- So to complete the proof (for n = 2'^) we only need to establish 
item (in) of Theorem 1. We will start by proving that every t-vertex subgraph of G on at least 
to vertices has edge expansion at most |(l/logt)(loglogt)^. We will then “boost” this fact using 
Lemma 3.6 and get that every t-vertex subgraph as above in fact has an edge separator of size 
at most (t/log t) (log log t)^, as required. Note that the statement of the theorem implies that 
d > One consequence of this fact is that d > (logd)®. We will also use the fact that 

the theorem’s assumption that k < log (i/648 together with the previous observation implies that 
d^/'^/(6k)‘^ > 12 and that 18A: < io/(31ogto)- Finally, suppose H' is a subgraph of G and that H' 
is a forest. As G has maximum degree at most 6k, we have by Fact 1.2 and the above relation 
between k and to that sepE(i^0 ^ < 6k < ^to/logto. Hence, if H' has t > to vertices then 

its edge expansion is at most (^t/logt)/(|i) = |(l/logt). 

Let then H he a, t-vertex subgraph of G with t > to vertices, and let us prove that the edge 
expansion of H is at most |(l/logt)(loglogt)^. Since we have already established this fact when 
H IS a forest at the end of the previous paragraph, we assume for the rest of this paragraph that H 
is not a forest. Suppose first that t < 2'^^^^. By Lemma 3.5, if the edge expansion of H is at least 
1/logt then its girth is at most 12(logt)^ < 12(i^/® < d/{6k)^. However, G does not contain a cycle 
this short. Therefore, the edge expansion of H is indeed at most |(l/logt)(loglogt)^. Suppose 
now that t > . By Lemma 3.3 and the fact that the average degree of H is at most 6k, the 

edge expansion of H is at most 24A: log d/log t. Thus, to prove our claim it suffices to show that 
y/72k log d < log log t. This indeed follows from the theorem’s assumption that k < log d/648, since 
it implies that 

\/72/clogd < (logd)/3 = log(d^'^^) < log log t . 

Having established that every subgraph of G on at least to vertices has small edge expansion, 
we now wish to show that every such graph has a small separator. So let H be an arbitrary 
t-vertex subgraph of G with t > to- We apply Lemma 3.6 on the graph H with the function 
f(x) = I(1 /logo;)(loglognoting that, as required by the lemma, every t'-vertex subgraph of 
H with t' > n/3 (> to) has expansion at most f(t'), and / : [n/3,n] —M is decreasing (here we 
use the fact that, as can be easily checked, /(t) is decreasing for t > 256, and that n/3 > 256). We 
deduce that sepE(dd) < (2/3)t/(t/3) < (t/logt)(loglogt)^, as required by item (in) in Theorem 1. 
This completes the proof of the theorem for the case n = 2^. 

Finally, let us consider the case of an arbitrary n, that is, not necessarily a power of 2. In this 
case we set d = [logn]. Since n > 2^/2, a random n-vertex subgraph of Qd, 2 k will have, with 
positive probability, average degree at least k. Let G be such a graph. Then the maximum degree 


13 




of G is still at most 6k, its girth is at least logn/(6A:)^, and it is easy to see that every t-vertex 
subgraph H of G with t > logn/(6/c)^ satisfies sepE(-ff) < (t/logt)(loglogt)^. This completes the 
proof. □ 


4 Proof of Theorem 2 

In the first part of the proof we will show that for every t-vertex subgraph H of G with t > 
^ log n/(6A:)^, and every 1/t < fi < 2/3, there is a subset A C V{H) of size fit/3 < |2l| < fit 


satisfying 


(pH {A) < 


log(l^) 

logt 


(4 log logt)" 


( 10 ) 


In the second part of the proof we will show that if t > log n/( 6 /c)^ (as guaranteed by the statement) 
and l/t < fi < 2/3 then there are in fact 1/4/i mutually disjoint subsets of the same size and edge 
expansion only slightly larger than in ( 10 ). 

Before we begin, we collect some useful facts. One can easily verify that the theorem’s assump¬ 
tion on k implies that t > y/\ogn (> 2^®),^^ and therefore A{H) < | log log log n < log log t. Notice 
we may assume fi > f/y/t, as otherwise (10) trivially holds by taking A to be any single vertex, in 
which case (p{A) < A{H) < loglogt < (log(l//i)/logt)(41oglogt)^, as needed. For the rest of the 
proof we set 

fix) = ( 1 /logx) • (log log x)^ . 

One can check that f{x) is decreasing for x > 256. Throughout the proof, unless otherwise 
mentioned, we write (p{-) = (pni') for the edge expansion in H, and d{-) = dni') for the edge 
boundary in H. 

In order to obtain a subset A C V{H) satisfying (10) we will iteratively find smaller and smaller 
subsets, such that in each step the edge expansion (in H) of our subset does not increase by much. 
Formally, we claim that for any set S C V{H) with |5| > y/t {> 256) there is a subset S'! C S' of 
size I |S| < |Si| < I |S| such that (p{S\) < <p{S) + 2/(|S|). To see this, consider an edge separator 
in the induced subgraph F1^[S]. Namely, let S = S' U S" be a partition of S with |S'| , |S"| < | |S| 
such that the number of edges between S' and S" in H[S], and thus in H, is sepE(Ff[S]). Consider 
now the edge boundary of S' and S" in H. We have 

|a(S')| + |a(S")| = |a(S)|+2sepE(//[S]) . 

Moving to edge expansion, observe that this means that 


min{</(S'), 0(S")} < 


|5(S')| + |a(S")| _ \d{S)\+2sepEiH[S]) 


|S'| + |S" 


|S| 


<<P{S) + 2f{\S\) 


where in the left inequality we used the elementary inequality (o + 6)/(c + d) > min{o/c, b/d}, and 
in the right inequality we used the fact that sepE(Fl^[S]) < |S| /(|S|). This last inequality follows 
from item (hi) of Theorem 1 when |S| > logn/(6/c)^, while when |S| < logn/(6A:)^ it follows from 
item (ii) of Theorem 1, Fact 1.2 and the fact that / is decreasing, as these facts together mean 


^72k^ < (log log log n)^ < ^/logn-, for the last inequality, note the statement of the theorem implies logn > 2^ 
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that sepE(-ff[5']) < A.{H) < loglogt < y/tf{t) < |S'|/(|S'|). Altogether we see that we can take 
either S' or S" as the set Si. Applying this claim with S = Sq = V{H) yields a subset Si C Sq 
of size I liSol < IS'il < I |S'o| satisfying (p{Si) < (j){So) + 2/(|5o|) = 2/(|S'o|), where here we use the 
fact that Sq = V{H) so (j){So) = 0. Having picked Si, for some f > 1, we can then pick a new set 
Si+i C Si satisfying | |5j| < |5i+i| < | jS'*! and (p{Si+i) < (p{Si) + 2/(|5j|). We stop the process 
once we obtain a subset A C V{H) of size |A| < fit. Note that this means that |A| > fit/3. Let 
V(H) = So D Si D ■■■ D Ss = A denote the subsets we thus obtain. We get from the above 
observations that 

s-l 

4>{A) < 2/(|5’i|) < s • 2f{fit) , 

i=0 

where in the second inequality we used the fact that f{x) is decreasing. Since the number s of 
iterations is at most |'log 3 / 2 (l//u)l < 21 og 3 / 2 (l//^), we get 

(j){A) < 21 og 3 / 2 (l//x) • 2f{Vi) < • 16(loglogf)2 , 

where in the first inequality we used the assumption that fit > i/i. This completes the first part of 
the proof. 

Having found a single non-expanding subset A satisfying (10), we proceed to find at least 1/4/r 
disjoint subsets of size in [fit/3, fit] having small expansion, as required by the theorem’s statement. 
So let H = iy,E) be a f-vertex subgraph with t > logre/(6A:)^ and suppose 1/t < fi < 2/3. Since 
\V\ > I logre/(6A:)^, we can find a subset Ai'ZV satisfying (j>H{Ai) < ip{fi,t) where ^p{fi,t) is the 
right hand side of (10). Now apply the same argument on the induced subgraph H[V\Ai] in order 
to hnd a subset A 2 C 1/ \ Ai of size in [fit/3,fit] that satisfies^^ 4>hiv\Ai]{^2) < Note that 

we can pick sets Ai C Aj at least 1/2/r times (assume henceforth that 1/2/x is an integer), 

since as long as we pick fewer sets we are left with at least t/2 > ^ logn/(6A:)^ vertices, so we can 
still pick another set. We therefore obtain at least 1/2/i mutually disjoint subsets Ai,..., ^ 3 / 2 ^ ^ ^ 
of size in [fit/3, fit] each, satisfying 

for every 1 < z < 1 /2//. To hnish the proof, we claim that at least half of the 1 /2// subsets Ai have 
small expansion in H itself. To see this, note that 

1/2^4 1/2/4 1/2/1 

l^niAi)] < 2 ^ ~ ^ 1^*1 ■ > ( 1 ^) 

i=i i=i ' i=i ' 

where in the first inequality we use the fact that each edge is counted (at most) twice, and in the 
second inequality we use (11) and the fact that each Ai is of size at most fit. Dividing both sides 
of (12) by fit/3 (which is an upper bound on the size of the sets Ai) we get that Yl^i=i 4>HiAi) < 
3^p{fi,t)/fi. Hence, by averaging, at least l/4/i of the sets A* satisfy 

(t>H{Ai) < I2if{fi,t) < (log(l///)/logf)(141oglogf)^ , 
thus completing the proof. 

subset T 2 C 1/ \ Ai of size at most /it is therefore of size at most /I'lT \ Ti| with fi > /t. Thus, we can find a 
subset T 2 satisfying (A 2 ) < f>{fi',t) < ip{ii,t) where we use the monotonicity of ip{ii,t) with respect to /t. 
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5 Missing Proofs 


5.1 Proof of Lemma 3.5 

Proof of Lemma 3.5. Let k denote the average degree of our graph G = iV, E). Since G is connected 
and not a tree we have k > 2. As is well known, G must have a subgraph of minimum degree at 
least [k/2\ + 1 > 2. In particular, this subgraph contains a cycle. Let e = uv he any edge in such 
a cycle, and let G' be obtained from G by removing e. Note that our choice of e guarantees that 
degQ/(u),degg/(u) > [k/2\. We claim that the edge expansion of G' is at least a/2. Indeed, let 
S E V with e € dciS). Then 19 ( 3 ( 5)1 > 2, as otherwise S forms a connected component of G', 
contradicting the fact that e is not a cut edge in G. Therefore, removing a single edge can decrease 
the edge expansion of G by at most a factor of 2. 

For a vertex x gV and a non-negative integer r, write Bx{r) = {y : 5G'{x,y) < r}. Let m^^r) 
denote the number of edges spanned by Bx{r). Since G' has edge expansion at least a/ 2 , we have 
mx{r + 1) > (1 -|- a/2)mx{r) provided \Bx{r)\ < re/2. Put R = (2/a -|- l)(ln(re) -|- 1). We claim that 
\By{R)\ > re/2. Indeed, recalling rre^(l) = deg( 3 /(u) > [A:/2j, we get that if \By{R)\ < re/2 then 

m^{R + !)>(! + a/2)^ • [k/2\ > • [A:/2j > re/c/2 , 

which is strictly greater than the number of edges of G' - a contradiction. Since we also have 
deg( 3 /(re) > [k/2\ we deduce that Bu{R) n B^{R) / 0. This means that 5G'iu,v) < 2R. Since 
u and V are adjacent in G but not in G', we conclude that G has a cycle of length at most 
2R + 1 = 2{2/a + l){ln{n)+ 1)+ 1. □ 


5.2 Proof of Lemma 3.6 


Proof of Lemma 3.6. We iteratively remove subsets 5i,...,5fc C V {G) as follows. For the f-th 
step, consider the graph Gi := G — U}=i ^ have size \Si\ < \V{Gi)\ /2 and 

expansion in Gi at most f{\V{Gi)\). We stop removing subsets once the set 5 := Uj=i is of size 
at least re/3, and thus |5| < + |P(G'fc)l /2 = (re -|- I'S’jl)/^ < 2re/3 . Now, observe 

that every edge in the edge boundary daiS) is a member of some edge boundary dcjiSj). Hence, 
the expansion of 5 in G is 


|8g(S)I ^ 

l«l “ Eh. IS.I 


< max 


l^il 


- fi\'^iGj)\) Z f{n/3) , 


where in the second inequality we used the inequality {Y^=i ^j)/iY^=i Vj) — /yj) for 

non-negative reals Xj,yjf^ and in the last inequality we used the fact that |F(Gj)| > \V{Gk)\ > 
re — |5| > re/3 together with the fact that f{t) is decreasing for t > re/3. It follows that | 9 ( 3 ( 5 )| < 
|5| f{n/3) < (2/3)re/(re/3), completing the proof. □ 

note that while the simple argument here suffices to bound the expansion of (JSi, bounding the expansion 
of each individual Si - which is done in the second part of the proof of Theorem 2 - is trickier. 
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5.3 Proof of Lemma 2.5 

Proof of Lemma 2.5. Put L = dl — A where A is the adjacency matrix of G = (P, E). Note that 
the k-th. eigenvalue of L is d — A^. By the Courant-Fischer well-known min-max theorem, we have 


d — Xk = min max Rl(x) , 


(13) 


where the minimum is over all collections of k mutually orthogonal, nonzero vectors in and 
Rl{x) = x^Lx/x^x is the Rayleigh quotient. Let Si,... ,Sk C P be mutually disjoint, non-empty 
subsets, and denote by I 5 . the characteristic { 0 , l}-vector of Si. As it is easy to see that x^Ly = 
Y^{u,v}£Ei^u-Xv){yu-yv), we deduce that and that for i ^ j we have = 

—e{Si,Sj) where e{Si,Sj) is the number of edges between Si and Sj. Consider now any vector 
X G span{l 5 ^,. ..,15*,} and write x = Ya^i cAsi. Then 


c? + c? 


x-Lx = acj • I 5 .LI 5 , - ^CiCje(5i,S’j) < l'9('S'i)l + Y ' o ^ e{Si,S^ 


2 = 1 


i¥^j 


i,j=^ 

k k k 

Y cl |5(5*)| + E E ^ 2 E c? 1^(5} 

i=l j^i 


2 = 1 




2=1 


2=1 


The proof now follows since (13) implies 


d 


Xk < max Rl{x) < 

0^xespan{lsj,...,ls^} 


2Ylici\d{s,)\ 

Ylicl\S^\ 


< 2 max d>G(Si) . 
l<i<k 


□ 


6 Concluding Remarks and Open Problems 

• As we mentioned in Section 1, we can replace the (log log t)^ term in ( 1 ) with (log log t)^'’''’^). 
It would be interesting to know whether the log log t factors in ( 1 ) (and thus also in (2)) 
can be removed. One way to obtain this is to improve Lemma 3.3. In this regard, we 
conjecture the following “higher-order” isoperimetric property of the hypercube: Among all 
subgraphs of the d-cube on 2* vertices, for every i G N at least d^/^ say, the subcubes have the 
largest normalized^^ edge expansion. (One can also come up with an analogous isoperimetric 
conjecture for vertex expanders in the hypercube.) Since a subcube on t = 2 * vertices has 
normalized edge expansion 1/i = 1 /logt, proving the above conjecture would mean that the 
additional logd factor in Lemma 3.3 is unnecessary for t > (note that without the lower 
bound on t, Lemma 3.3 is tight as witnessed by iLi,d). By the proof of Theorem 1, this would 
mean that the graphs Gn,k are such that the log log t terms in ( 1 ) and ( 2 ) can in fact be 
replaced by 0{k). Note that this is best possible, as witnessed by the subgraph Gn,k itself. 
We note that this would essentially close the gaps in the applications mentioned in Section 2, 
as k is chosen there to be either constant or grow arbitrarily small with n. For example, 

^®I.e., edge expansion divided by the subgraph’s average degree. 
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this would imply a truly polynomial lower bound on the threshold rank in Corollary 2.4, 
thus proving that the eigenspace enumeration approach of [3] cannot solve UGq((5) in time 
exp(n°*^^^) for constant 6,q. 

• It is not hard to see that if every t-vertex subgraph H oi a graph G satisfies sepE(^^) < 
(t/logt)(loglogthen G must have at most 0{n{\og\ognY') edges. (Heuristically speaking, 
the upper bound on the number of edges admits a recursion of the form f{n) < 2f{nl2) + 
(n/logn)(loglogn)'^). We note that one can construct a graph G satisfying this requirement 
using arguments similar to those we used to prove Theorem 1, and in this case, the resulting 
graph would have n(n(loglogn)^") edges. (To be more precise, c' = c + 1 and c” = c — 1.) 
This means that the graph we construct in Theorem 1 has the maximum possible number of 
edges, up to log log n factors. 

The situation for vertex separators, as in Theorem 3, is not so clear, so it would be interesting 
to understand the maximum number of edges of re-vertex graphs in a hereditary family with 
vertex separators of size at most (re/ log re)(log log re)'^. Although we construct such a family F 
that contains re-vertex graphs with n(re(loglogre)'^”) edges, we cannot rule out the possibility 
of improving this to (say) re log re. 

• Let us mention two problems that seem to be related to the types of problems studied here 
but that (unfortunately) we cannot resolve. The first is a nice problem of G. Kalai. Suppose 
J-" is a hereditary family of graphs and that every graph in the family has a vertex separator of 
size re//(re). Then, how fast should /(re) grow so that J- has only non-isomorphic graphs 
on re vertices? It was shown in [11, 12] that /(re) > log^^'^re suffices while /(re) < log^~''re 
does not. Glosing the gap between these bounds is still open. 

The second problem is related to a graph decomposition result from a work of Patra§cu 
and Thorup [25]. They proved that the edge set of every graph can be decomposed into 
b = 0 (log re) subsets Ei,..., Ef, so that for every 1 < i < b the graph spanned hy Ei,..., Ei 
has “edge expansion” 1 / log re, where the notion of edge expansion used here is slightly different 
from the (standard) one we used throughout this paper. It would be interesting to decide 
whether the parameters in the construction of [25] are optimal. 

Acknowledgment: We are grateful to Alex Samorodnitsky for very helpful discussions related to 

this work. 
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